A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval

نویسندگان

  • Badam-Osor Khaltar
  • Atsushi Fujii
چکیده

In Modern Mongolian, a content word can be inflected when concatenated with suffixes. Identifying the original forms of content words is crucial for natural language processing and information retrieval. We propose a lemmatization method for Modern Mongolian and apply our method to indexing for information retrieval. We use technical abstracts to show the effectiveness of our method experimentally.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhancing Lemmatization for Mongolian and its Application to Statistical Machine Translation

Lemmatization is crucial in natural language processing and information retrieval especially for highly inflected languages, such as Finnish and Mongolian. The state-of-the-art method of lemmatization for Mongolian does not need a noun dictionary and is scalable, but errors of this method are mainly caused by problems related to part of speech (POS) information. To resolve this problem, we inte...

متن کامل

Research on Reasoning and Retrieval Methods Based on Mongolian Curriculum Areas of Semantic Web

The backwardness of the Mongolian network teaching resources results in its low reuse rates and utilization. For this situation, a retrieval method of semantic web based on Mongolian curriculum areas was set up. Firstly, the method established the Mongolian ontology of course ‘Artificial Intelligence ( )’in area of teaching, it uses a relationship database MySQL to record ontology information, ...

متن کامل

Performance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature

Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...

متن کامل

Design and Realization of Mongolian Syntactic Retrieval System Based on Dependency Treebank

In the past seven years, Language Research Institute of Inner Mongolia University has constructed a 500,000word scale Mongolian dependency treebank. The syntactic treebank provides a favorable data platform for language research and information processing. In order to effectively use the treebank, we have designed and implemented a graphical syntactic information retrieval system based on the M...

متن کامل

Testing and Validating the Role of Interactive Information Retrieval Model in Faculty Members' psychological Enabling: A Case Study of Alborz University of Medical Sciences

The term "electromagnetic fields" (EMF) is a combination of electric and magnetic fields as a diagnostic method as well as a therapeutic tool with many advantages such as ease of operation and painlessness, very controllable, which today has found wide application in regenerative medicine and also cancer treatment.  In addition to organs such as nerves, hearts, and bones that have an electrica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008